THE EXPECTED NUMBER OF INVERSIONS 
AFTER n ADJACENT TRANSPOSITIONS 

MIREILLE BOUSQUET-MELOU 

Abstract. We give a new expression for the expected number of inversions in the product 
of n random adjacent transpositions in the symmetric group S m +i. We then derive from this 
expression the asymptotic behaviour of this number when n = n m scales with m in various 
ways. Our starting point is an equivalence, due to Eriksson et al., with a problem of weighted 
f~ "* ) walks confined to a triangular area of the plane. 
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1. Introduction 



Let &m+i denote the group of permutations on the set {0, 1,2,..., m}. Define the adjacent 

transposition Si to be the two-cycle (i,i + 1). We consider the adjacent transposition Markov 

chain t:^, tt( 2 \ . . . on 6 m +i. That is, n^°> is the identity, and 7r( n+1 ) = 7P™'si, where 

' i G {0, . . . , m — 1} is chosen uniformly at random. For instance, denoting 7r = ttqiti ■ ■ ■ 7r m if 

. 7r(z) = TTi for all i, we may have, when m = 3. 

> . 

7r<°> = 0123 = id, 
ttW = 0213 = 7r(°) Sl , 
ttW = 2013 = ttWsq, 
(N ■ ir^ = 2103 = tt( 2 ) Si . 

> . 

C*~) , Markov chains on finite groups have attracted a lot of interest in, at least, the past 30 years, at 

the interface of algebra and probability theory Q~| [TT| [TH [T51 [23 [27J . Such chains are also studied 
in computational biology, in connection with models of gene mutations [21 EH EH [201 [55] • A 
central question is to estimate the mixing time of the chain, for large values of m. (The mixing 
time is the number of steps the chain takes to approach its equilibrium distribution.) The 
above chain is periodic of period 2, since Tv^ 2n ^ is always in the alternating group. Thus it has 
no equilibrium distribution. However, an elementary modification (which consists in choosing 
^.(ra+i) _ ^(n) w j^ n probability l/(m+ 1) and otherwise multiplying by a transposition Sj chosen 
uniformly) makes it aperiodic, and the equilibrium distribution is then uniform over & m +\. The 
mixing time is known to be 8(TO 3 logm) (see Aldous [I], Diaconis and Saloff-Coste [14] and 
Wilson [3D]). 

More recently, Berestycki and Durrett [1] studied a continuous time version of this chain, 
denoted (X t m ) t >o, where multiplications by adjacent transpositions occur according to a Poisson 
process of rate 1. The connection with the chain described above is straightforward. The 
authors focussed on the inversion number D™ of X™, as a function of m and t. They established 
the convergence in probability of this random variable, suitably normalized, when t = t m scales 
with m in various ways. The limit is usually described in probabilistic terms, except in one 
simple regime, ra « i << m 3 , where D™/y/mi is shown to converge to Of course, the 

interesting regimes are those occurring before stationarity. 

This paper actually stems from a remark made in Berestycki & Durrett's paper. They quote a 
paper by Eriksen |18| . which gives a rather formidable expression for l mi „, the expected number 
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of inversions in 7P n ' : 



n -. 

V- 

^ m r 



r—l v 7 s— 1 

with 



r - 1 



^ \s - 1 



(-4) r s g s , m h s , m , (1) 



EE(- 



l) fe (m- 2f) 



fcO fc>0 

and 



2\s/2] - 1 
fa/21 +e + k{m + 1) 



, ( _ iy Y 2[S/2J 

U-/2J+J(m + 1) 

The authors underline that "it is far from obvious how to extract useful asymptotic from this 
formula". This observation motivated our interest in the expectation l TOj „. As the problem is of 
an algebraic nature, it should have much structure: would it be possible to find an alternative 
expression of \ m ,m with a neater structure than (JXJ) , and to derive asymptotic results from this 
expression? 

This is precisely what we do in this paper. Our alternative formula for I 
Theorem 1. Let m > 1. For k G Z, denote 

(2fc + iy . , ,4 

a ^ = ~^ r?T' Cfc=cosa fe , Sfc = sma!fc and x jk = 1 (1 

2m + 2 to 

T/je expected number of inversions after n adjacent transpositions in & m +i is 



m(m + 1) 1 x ^ (cj + Cfe 

4 8 (m+l)\ 4^„ _ 



,2 



k,j=0 

Equivalently, the generating function \ m (t) = X)n>o ^m,nt n is 

I /.s _ m (™ + 1) 1 ( c 3 + c fc) 2 1_ 

mtj " 4(1 -t) 8(to+1)^ o i_ tei4 - ^ 

This result is proved in Section^ and asymptotic results are derived in Sections[3]to[5]for three 
main regimes: linear (n m = O(to)) and before, cubic (n m = 0(to 3 )) and beyond, intermediate 
(to <C n m <C to 3 ). For the moment, let us give a few comments and variants on Theorem [TJ 

Limit behaviour. The chain tt^ has period 2, as w^ n ' is an even (resp. odd) permutation if 
n is even (resp. odd). But the sub-chains 7r' 2 ") and Tr^ 2n+1 > are aperiodic on their respective 
state spaces, the alternating group 2t TO +i (the group of even permutations) for 7r( 2 ™) and its 
complement 6 m +i \ 2l m +i for 7r^ 2n+1 \ Moreover, each of these chains is irreducible and sym- 
metric, and thus admits the uniform distribution as its equilibrium distribution. For m > 3, 
the average number of inversions of an element of 2l m +i is Jn(m + l)/4, and the same is true of 
elements of <5 m +i \ 2l TO +i. Thus when to > 3 is fixed and n — > oo, we expect \ m ,n to tend to 
to(to + l)/4. This can be seen on the above expression of \ m>n , upon observing that, for m > 3 
and < j,k < to with j + k ^ to, there holds Xjk G (—1,1). (The condition j + k ^ to is 
equivalent to Cj + c k ^ 0.) If m > 8, the stronger property € (0, 1) holds, which shows that 
\ m ,n is an increasing function of n. 

Eigenvalues of the transition matrix. Another consequence of the above theorem is that we 
have identified a quadratic number of eigenvalues of the transition matrix of the chain (see |18) 
for a detailed account of the connection between these eigenvalues and l m , n )- 

Corollary 2. Let c k = cos ^~T2 • The transition matrix of the adjacent transposition Markov 
chain on & m +i admits — among others — the following eigenvalues: 

4 

Xjk = 1 (1 - CjC k ), 

TO 
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for < j, k < m and j + k =/= m . 

This result is not really new: as pointed to us by David Wilson, one can derive from the proof 
of Lemma 9 in [30] that for < p, q < to, with p =/= q, 

pir qir 
1 2 — cos cos ■ 



1 m + 1 

is also an eigenvalue. This collection is larger than the one we have exhibited, as can be seen 
using 

o (k-j)it (k+j + l)n 

2cjCk — cos h cos . 

to + 1 to + 1 

The transition matrix has still more eigenvalues, for instance —1, with eigenvector {£(c))aee m +i 
where e denotes the signature. The paper [17] . originally motivated by a coding problem, gives 
a linear number of eigenvalues of the form i/m, where i is an integer. There exists a description 
of all eigenvalues in terms of the characters of 6 m +i (see [151 Thin. 3]). This description is 
valid in a much more general framework, and it seems that the complete list of eigenvalues is 
not explicitly known. 

Rationality of l m (t). That the series \ m {t) is always rational should not be a surprise: this 
property is clear when considering the transition matrix of the chain. Here are the first few 
values: 

h{t) 

U(t) 



(l-*)(l+i)' 

t(2 + 1) 
(l-t)(2-t)(l + *)' 

3^(27 + 9t - 7t 2 -t 3 ) 
(1 - t)(9 + 6t - t 2 )(9 - 6t - t 2 ) ' 
t (256 - 192 1 - 48 1 2 + 44 1 3 - 5 1 4 ) 
(1 - t)(16 - 5t 2 )(16 - 20i + 5t 2 ) ' 



A related aperiodic chain. If we consider instead the aperiodic variant of the chain (obtained 
by choosing 7r(™ +1 ) = 7f( n ) with probability l/(m + 1) and otherwise multiplying by a transposi- 
tion Si chosen uniformly), it is easy to see that the expected number of inversions after n steps 
is now 

n s \ j. 

\ m ,n=J2[ J( m+ l)n ^ 
k=0 v 7 v ; 

so that the associated generating function is also rational: 

m() ■ m - n " 1 - 1 1 (to + 1) m I m + 1 -t 

n>0 I \ ' I \ 

More generally, if a transposition occurs with probability p, the generating function of the 
expected number of inversions is 

1 , ( tP 



l-t(l-p) m \l-t(l-p) 

(Above, p = to/ (to + 1), but in [30] for instance, p = 1/2.) 

Alternative expressions. Theorem Q] gives a partial fraction expansion of \ m (t). One of the 
advantages of §3§ is that the coefficients involved in the sum over j and k are negative, which 
makes the list of poles of l m (i) clear (we have actually already used this to state Corollary [2]) . 
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A number of variations are possible, and we will use some of them in the asymptotic study of 
the numbers l mi „. For instance: 

I m m(m + l) _ 1 ^ Cj + c fc l_ 

m[ ' 4(1 - t) 8(m + l)^ 2^ (1 - Ci )(l - c k ) 1 - 1 ) 

1 V Cj " + ° k ( — - 1 ^ f 5 l 

(m + 1)2 (1 _ c . )(1 _ Ck) y ! _ t ! _ tejfc ; ■ 

Both formulas are proved in Section [5] 



2. The expected number of inversions 

We prove in this section Theorem Q] and its variants (J^HSJ) . Our starting point is a functional 
equation satisfied by a series related to l m (i). We then solve this equation using the bivariate 
kernel method, which has proved successful in the past few years in various enumerative problems 
related to lattice paths, permutations, and other combinatorial objects [JJ [HI O EH [231 E3 
I31j . The equation under consideration here can be interpreted in terms of weighted lattice 
paths confined to a triangular portion of the square lattice. Another problem of walks in a 
triangle, also related to the adjacent transposition Markov chain, was studied by Wilson j3D] via 
a diagonalization of the adjacency matrix. The enumeration of (unweighted) walks in a triangle 
was performed by Flajolet |21| using the reflection principle, in connection with a problem of 
storage raised by Knuth Ex. 2.2.2.13]. 



2.1. A FUNCTIONAL EQUATION 

For 7r € © m +i, let us write 7r = ixq-k\ ■ ■ ■ ir m if ir(i) = iTi for all i. For n > and < i < j < m, 

let 

(™) TTD/ M (n) s 

Then the expected number of inversions in is 



E 



(6) 



0<i<j<m 

Examining how the numbers p\ '■ may change at the rfi 1 step gives a recurrence relation for 
these numbers, first obtained by Eriksson et al. |20| . As shown by the lemma below, it converts 
our problem into the study of weighted walks confined to a triangular region of the square 
lattice. Consider the subgraph G m of the square lattice ZxZ induced by the points with 
< i < j < m (Figure [TJ. We use the notation •<-» (k, t) to mean that the points and 
(fc, £) are adjacent in this graph. 



(to - 1) 



1 (to-1) 
Figure 1. The graph G m . 
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Lemma 3 f|20|). The inversion probabilities p\^ are characterized by the following recursion: 



p.| ' = for < i < j < m, 



and for n > 0, 

(M)<-K».i) 

where 6ij = 1 if i = j and otherwise. 

As often, it is convenient to handle the numbers p^ n J via their generating function: 

P(t;u,v)=P(u,v):=J2t n E P^^. 

n>0 0<i<j<m 

Multiplying the above recursion by t n+1 , and then summing over n, gives the following functional 
equation for P(u,v). 

Corollary 4. The series P(u, v) satisfies 

1 — t H — -(4 — u — u — v — v)) P(u,v) ~ 
m J 

t / 1 — v m v m \ 

~ 71 vr^T " ( fi " " ( v ~ l)* m - l Pt(u) (u + v)P d {uv) , (7) 

m \ (1 — — t) J 

where u = l/u, v — 1/v, and the series Pi, P t and P d describe the numbers p^ n ) on the three 
borders (left, top, and diagonal) of the graph G rn : 

p M = E* n E pS3«*. 

n>0 0<i<m 

f 

n>0 0<i<m 

(n) 

f 

n>Q 0<i<m 

In view of (|6]), the generating function we are interested in is 

L(t) = ^i™/ = p (M), 

n>0 

which, according to the functional equation Q, may be rewritten 

t 2tP d (l) 

In the next subsection, we solve ([7|l. at least to the point where we obtain a closed form expression 
of -Pcz(l), and hence a closed form expression of l m (t), as announced in Theorem [TJ 

2.2. Solution of the functional equation 

We first establish a symmetry property of the series P(u,v). 
Lemma 5. The series P(u,v) satisfies 

P(u,v) = i^-V^P^u) 
uwi/i tZ = l/u, V = l/u. In particular, the "diagonal" generating function Pd{u) satisfies 

P d (u) = u^Paiu). (10) 



p t (u) = E <n E n, m -x*, 

n>0 0<i<m 

Pd{u) = E fl E pm" 1 - (s) 
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Proof. This can be derived from the functional equation satisfied by P(u,v), but we prefer to 
give a combinatorial (or probabilistic) argument. 

Let r be the permutation of © m +i that sends k to m — k for all k. Note that r is an 
involution. Let $ denote the conjugacy by r: that is, $(<t) = tut. Of course, $(id) = id and 
$(<tct') = $(ct)$(ct'). Also, $ has a simple description in terms of the diagram of a. in which cr(i) 
is plotted against i: the diagram of $(<t) is obtained by applying a rotation of 180 degrees to the 
diagram of a. In particular, with Si = 1), one has $(si) = s m _i_; for < i < m. These 

properties imply that the sequence of random permutations (<J?(7P )), ^(Tr' 1 )), $(7r' 2 ^), . . .) follows 
the same law as the original Markov chain tt^\ tt^ , . . .). In particular, for < i < j < m, 

pQ = p(*(*<»>), > = v^Si-i > =^-!, ro -i-i- 

This is equivalent to the first statement of the lemma. The second follows by specialization. B 

The next ingredient in our solution is the "obstinate" kernel method of fn 181 lili ntJl 1251 1251 131) . 

The kernel of the functional equation ([7]) is the coefficient of P(u,v), namely 

K(u, v) = 1 — t H (4 — u — u — v — v). 

m 

Let ([/, V) be a pair of Laurent series in t that cancel the kernel: K(U, V) = 0. The series 
P(U, V) is well-defined (because P(u, v) is a polynomial in u and u). Setting u = U and v = V 
in ([7]) cancels the left-hand side, and thus the right-hand side. That is, denoting as usually 
U = l/U and V=l/V, 

V m - l {l-U m V m ) V m ~ 1 Pi(V) P t {U) V m - 1 (U + V) (TnA _ n 



(i - uv)(u - i)(v - - 1) v-i u-i (u-i)(v-iy 

provided U ^ 1 and V ^ 1. Let us now exploit the symmetries of the kernel: obviously, K(u, v) 
is invariant by the transformations u i— >■ m and w i-> u. Hence the pairs (J7, V), (£7, V) and ({/, V") 
also cancel K, and it follows that 

T?m-1 d./t/^ r> tfr\ f/m—Xftr i i>\ 

p d (tno = o, 

p rf (c/y) = o, 





v m - 1 (i 


_ jjmym\ 




V m - l Pt(V) 


Pt(U) 


y ra- 




■v) 


(1- 


UV){U- 


l)(V-l)(l 


-t) 


V -I 


U - 1 


il! - 




-i) 






- u m v m ) 






Pt(U) 


y ra- 




-v) 


(1- 


UV){U- 


l)(V-l)(l 


-t) 


V -I 


U - 1 


il! - 




-i) 










v m - 1 P i (y) 


Pt(U) 


y m- 


- x (cM 


■v) 


(1- 


UV)(U- 


1)(F-1)(1 


-*) 


V-l 


U-l 


(17- 


i)(y 


-i) 



p rf ([/y) = o. 

Let us form the alternating sum of the four previous equations: all occurrences of Pi and Pt 
vanish, and, using (fTDj) . one obtains, after multiplying by U m V m (l — U)(l — V): 



(UV + 1) (U m+1 + V m+1 ) P d (UV) + V m ~ x (U + V) (1 + u m+1 V m+1 ) P d (uv) 

UV ( (1 - U m V m ) (U m + V rn ) (U m - V rn ) (1 + U m V m ) 



l-t\ 1-UV v-u 



(11) 



as soon as K(U, V) = 0, U ± 1 and V ± 1. 

This equation involves only one unknown series, namely P^. So far, the series [/ and V are 
coupled by a single condition, if ([/, V) = 0. Let g be a complex root of q m+1 = — 1, and let us 
add a second constraint on the pair (U, V) by requiring that U = qV. That is, V must be a root 
of 

K{qV, V) = 1 - t + — (4 - (1 + q)(V + qVj) = (12) 
m 

where q = 1/q and V = 1/V. We further assume q ^ — 1 (otherwise K(qv,v) is independent of 
u). Then V ^ 1, U = qV ^ 1, and the first term of (fTT|) vanishes. We thus obtain an explicit 
expression of Pd(q), which we write below in terms of V and its conjugate root V = qV . 
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Lemma 6. Let q ^ — 1 satisfy q m+1 = —1, and let V = V(t) and V' = V'(t) be the two roots 
of ([H]). Then VV = q = l/q and 

q ( V m + V' m 1 - q V m - V' m 

p d(q) = 



(l -t)(v m+1 + v ,m+1 ) V i-g i + g V-V 

The series V and V' are algebraic (of degree 2) over C(t). But their symmetric functions 
are rational, and thus Pd{q) is rational too, as expected. The following lemma gives an explicit 
rational expression of Pd(q)- 

Lemma 7. Lei <? ^ —1 satisfy q m+L = —1. Assume q = q k := e "+ 1 . TTien 

2it ^ (c fc + Cj)(l - CfeCj) 



g fc (l - <)m(m + l)s fe ^ l-tx jk 

with 



(2j + l)?r (2j + 1)tt , , 4 . 

c, = cos — — , s, = sm — — and x-j k = 1 (1 — c,c/.). 

J 2m + 2 ' J 2m + 2 J m V J ' 



Equivalently, 



2q k (l-t)s k 2q k (m + l)s k ^1-tXjk 



Proof. We will establish a closed form expression for the coefficient of t n in (1 — t)Pd(q), which 
is clearly equivalent to the first expression of Pd(q) given above: for n > 1, 



a n := [t n ](l-t)P d {q) 
2i 



— 5^(c fc + <v)(l-pfc<y) x jk n l . (13) 



qm(m + l)s k 

In order to obtain this expression, we begin with applying Cauchy's formula to the expression 
of (1 — t)Pd(q) given in Lemma|51 Let V be the root of (fT2"]) that vanishes at t = 0. (The other 
root V = has a term in 0(l/i) in its expansion.) Then 



where the integral is taken along a small circle around the origin, in counterclockwise direction, 

= g Z^+jPT W t- m - (^) r 

W 7j m+1 + (g-D) m+1 V l-q l + q v - qv 

and v = l/v. Obviously, = R(qv). By (fT2")) . 

1 4 l + o, 

- = i-- + — -(v + qV) 

t mm 

so that 

771 K 

Thus the integral expression of a„ reads 

o„ = -!- / S(v) — 

where the integral is taken along a small circle around the origin, in counterclockwise direction, 
and 

S(v) = -^-^-(v - qv) ( 1 - — + ii^(?> + qv)) R{v) 
m \ m m I 
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satisfies S(qv) = —S(v). Note that the only possible poles of S(v)/v are and the (2m+2)th roots 
of unity (because q m+1 = — 1). Thus a n is simply the residue of S{v)/v at v = 0. Performing 
the change of variables v = qw, with w = 1/w, gives 



S{w) — 
w 



2in 



where the integral is now taken along a large circle around the origin, in clockwise direction. 
This integral thus collects (up to a sign) all residues of S{v)/v. The residue formula thus gives: 

2 ^ [ S( + ^ [ S( 

2iir I,, w 2in r % v 



V - ResJS) 



v 

„2 m+ 2 = 1 



E p w ( 14 ) 



2to + 2 

„2 m + 2 = 1 

where 

P(w) = (1 - w 2m+2 )S{w) = 

1 + q m+u ,fw m + (qw) m 1 - q w m - {qw) m \ / 4 1 + g. , x 

-w m+1 (w - qw) 1 v ; + - 1 + -(w + qw) 

m \ 1 — q 1 + q w — qw J \ m m 

Take v = e la with a = l<n/(m + 1) and < £ < 2m + 2 and recall that q = e 10 with 9 = {2 ^" ■ 
Then 

1 + q = 2e- l9/2 cos(0/2), 
1-q = -2ie ie/2 sin(6»/2), 
1-q = 2ie~ ,e/2 sm(9/2), 
1 + q = 2e lf)/2 cos(9/2), 

v m+1 = (-if, 

v-qv = 2ie-' 9/2 sin(a + 9/2), 

v + qv = 2e- l6/2 cos(a + 9/2), 
v m + (qv) m = 2i{-l) l+1 e w ' 2 sm{a + e/2), 
v m -(qv) m = 2(-l)V e / 2 cos(a + (9/2). 
Putting these identities together, one obtains 

An 

p (v) = : — 7777777 (cos((9/2) sin 2 (a + 9/2) + sin 2 ((9/2) cos(a + 9/2)) 

qmsm{9/2) 

x [ 1 h — cos(0/2) cos(a + 9/2) 

\ m m 

or, with the notation of the lemma, 

Ai 

P(v) = { Cj + Cfc)(l - c 3 c k ) x jk n 

qms k 

with j = k + I. Returning to (fT4")l now gives 



E {Ck + Cj)(l - CkCj) x jk 



n-1 



qm(m + l)sk . 

v ' j — —m — l 

which is equivalent to (|13|) . upon noting that Cj = C-j-\. The first expression of Pd{q) given in 
the lemma follows. 
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For the second expression, we simply perform a partial fraction expansion in the variable t, 
based on 

t 1/1 1 



(15) 



(l-t)(l-xt) l-x\l-t 1-xt 
and use 

m 

E c ^ = 

(this identity follows for instance from c rn -j = — Cj). ■ 

Recall that Pd(u) is a polynomial in u of degree m — 1. The above lemma gives its values at 
m, or even m + 1, distinct points. Thus Pd{u) is completely determined by these values, and we 
can recover it by interpolation. We use a version of Lagrange's interpolation that is well-suited 
to symmetric polynomials. 

Lemma 8. Let P(u) be a polynomial of degree m — 1 with coefficients in some field containing 
C. Assume P(u) is symmetric. That is, P(u) = u m ~ 1 P(u), with u = 1/u. Let £ = [ "' 2 ~ 1 J , and 
let qo, . . . ,qi be distinct elements of C such that q 3 q k ^ 1 for all k and j . Then 

K) v ' i(i+ft)^^fe-?i)fe-i/?i)' y> 

where Xm,o = 1 if m is even, and otherwise. 

When q k = e t9k with 9 k = ( 2fc + 1 ) 7r £/jj s can fr e rewritten as 

2 * i m+i , nV^n/ \ qksm9 k 



p (») = -^(« w+1 + 1 )E i, <« 



fc=0 



In particular, with the notation ([2)1, 

2« r>/ \ *Cfc 



p(i) = -^-^py 

m + 1 f— ' 



Proof. For the first part, it suffices to observe that the expression given on the right-hand side 
of (Unj) has degree m — 1, and takes the same values as P(u) at the 2f + 2 distinct points 
qo, . . . ,q#,l/qo, ■ ■ ■ ,1/qe, and also at —1 if m is even. This gives a total of m + 1 correct values, 
which is more than enough to determine a polynomial of degree m — 1 . 

For the second part, we observe that qo, . . . , l/cft)> . . . , 1/qe, together with —1 if m is even, 
are the m + l roots of u m+1 + 1. Hence 

(H.i»n(-.»-*i- ( ,_„,„_, ;rf 

which, in the limit u — > ^ , gives 

m + 1 



(1 + q k )X<n.o Y[ (q k - q 3 ){q k - l/q 3 ) = - 



Qk(Qk - l/qk) 

The second result follows. The third one is obtained by setting u = 1, and noticing that 
sin6 k = 2c k s k , while 1 — cos9 k = 2s 2 ,. I 

We can finally combine this interpolation formula with Lemma [7] to obtain an explicit expres- 
sion of Pd(u). As we are mostly interested in Pd(l) (see ©), we give only this value. 
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Proposition 9. Let £ — [ m 2 1 j , and adopt the notation @. Then the series Pd(u) defined 
by (0) satisfies: 



. m 1 x - x Ck ( 



Ck c 7 - + Cfc 



_^ mm 

"Tip EE 



+ c fe ) 2 (l - CjCfc) 1 



2(1 -i) 4(m+l)*£^ ^ l-te ifc 

Proof. We apply Lemma [5] to the second expression of Pd{qk) obtained in Lemma [7] This gives 
the first expression of Pd(l) above, provided 

£4 = e*i±i). (17) 

The latter identity is obtained by applying Lemma [5] to P(u) = 1 + u + • ■ ■ + ti m_1 = (1 — 

M m )/(l-lt). 

We now seek a symmetric formula in j and fc. Using c m ^j — —Cj, c m -k = — Cfe, s m _t = s& 
and a^fc = x m -j, m -k, we write 

' ~ i £ m 

Cfc Cj ~r C/c x x C-m — k Cm—j > C m ^k 



\ - \ - (-k Cj -t- Ck \ " \ " 

^ ^ S? 1 - tXik ^ ^ 



fcO^O^ 1 ^ k=0j=0 ™~ k lX m-].ra-k 

Ck Cj + C k 



m m 



^ s? 1 



k—m—l 3=0 K 



-.mm 

2 ^ ^ si 1 



fc=0 j=0 



m m / \ 



tXjk 



and this gives the second expression of Pd(l). 



Proof of Theorem^ Let us now return to the generating function \ m {t) whose coefficients 
give the expected number of inversions. It is related to Pd(l) by ©. Theorem [T] is obtained 
by combining the second expression of Proposition EH a partial fraction expansion in t (based 
on (|15p ). and finally the identity 



E ^|^ = 2m(m + l) 3 . (18) 

3 ' 



Si Si. 
,k=0 1 k 



To prove this identity, we write 

(cj +c k ) 2 



V WJL^l - V -1— ± fasr --r ands 

j,k=0 Ik jik=0 b j b k 



y j=0 "3 j \k=0 k / 

(3. 

Ci 



,j=0 S 3 I k=0 V 



and complete the proof thanks to (TIT)) 
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To obtain the expression (jH) of \ m (t), we write 

(Cj+Ck) 1 1 / Cj + Cfc Cj + Cfc 

.2^ Q S 2 S 2 1 _ tXjk 2 2^ I.! I Cj )(l - Cfc) (1 + C,)(l + Cfc) ; 1 - te ifc 

C; + C fc 



:0 (l-c,)(l-c fc ) 1 — tejjfe " 

The latter identity follows from replacing j by m — j and fc by m — k. 
Let us now extract the coefficient of t° in ((4]). This gives 

_ m(m + 1) ^ Cj + Cfc 



1 V Cj + Ck (19) 

+ ipA(l-c)(l-a)' 1 j 



4 8(m+ i)2 z^ (1 _ c . )(1 _ Cfc 

and the expression of \ m (t) follows. 

3. Small times: linear and before 

When m is fixed and n — > oo, the asymptotic behaviour of the numbers l m! „ is easily derived 
from Theorem [TJ as sketched just after the statement of this theorem. For m > 3, 

m(m + 1) 

/m,n = — ^ " + O(x 00 n ) 

where .too = l -^sin 2 

In this section and the next two ones, we consider the case where n = n m depends on m, and 
m — > oo. As in |4], three main regimes appear: linear (n m = 0(m))), cubic (n m = 0(m 3 )) and 
intermediate (m <C n m <C to 3 ). This can be partly explained using the following simple bounds. 

Lemma 10. For m > 3 and n > 0, there holds 

m(m + l) m(m + 1) c 2 „ 

(1 - X 0Q ) < \ mn < — , 9 4 Too • 

4 4 2(m + 1) sg 

In particular, if n = n m and m — > oo, 

m(m+l) , 2 n . 
Im,n = — ^ " - <d{m 2 x 00 n ). 

Proof. These inequalities follow from the expression of l m „ given in Theorem [TJ The upper 
bound is obtained by retaining only, in the sum over j and k, the term obtained for j = k = 0. 
The lower bound follows from \Xjk\ < %oo an d (| . ■ 

Observe that Too = 1 — 0(l/m 3 ). The lower bound on \ m>n already shows that if n 3> to 3 , 
then Too™ = o(l) and l mi „ ~ m (™+ 1 ) _ Also, if n ~ kto 3 for some k > 0, then Too™ ~ a f° r some 
Q' € (0, 1): then l m>n is still quadratic in m, but the upper bound shows that the ratio \ m , n /m 
will be less than 1/4. The other regimes correspond to cases where n -C to 3 . 

This section is devoted to the linear (and sub-linear) regime. We first state our results, and 
then comment on their meaning. 

Proposition 11. Assume n = n m = o(m). Then 

^ = 1 + Ofn/m). 
n 

Assume n = n m = 0(m). That is, K\m < n m < Kim for two positive constants K\ and k^. 
Then 

^ = f{n/m) + 0(l/m) 
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where 

1 r l-exp(-8^ 2 /(l+t 2 )) ^ 

= *rj W^) dt 

j>0 ; 

Comment. When n m is sub-linear, the inversion number equals its largest possible value, n, 
with high probability. For instance, liooo.io — 9.9. When n m grows linearly with m, the expected 
inversion number is still linear in n, but with a ratio /(k), where k — n/m. This ratio decreases 
from f(0) = 1 to /(oo) = as k increases from to oo. The fact that /(0) = 1 is consistent 
with the sub-linear result. 

Note that for a related continuous time chain, with inversion number D^, it has been proved 
that when t = nm, the random variable D^Jt converges in probability to a function of t described 
in probabilistic terms 0]. 

Proof. The starting point of both results is the following expression of \ m ,n, which corresponds 
to ©: 

'"*■« = zr Z~n2 E n ~ \ ( l ( l ~^ - ■ ( 20 ) 

8(m + 1) 2 (1 - Cj )(l - c k ) V V m v / / 

K,J— V 



Assume n m = o(m). Then 



uniformly in j and fc. Thus 



4 V 4n 

l--(l-c,. Cfc ) =!- (!- c . Cfe ) + (l- CiCfc )2 ( 



■ » {cj + c k ){l - CjC k ) 1 \ - (1-CjCfc) 2 n f n2 \ 

™.» 2m(m + l) 2 t tn (l-9)(l-c fc ) + 4(m + l) 2 ^ (1 - Cj )(l - c fc ) UV 



k,j=0 v J/v ' v ' k,j=0 

(we have bounded \cj + Ck\ by 2) and the result follows using 

(Cj + Cfc) (l - CjCjfe) 
(1 - Cj 

and 

(1 - CjCfc) 



E ( 1 t -~)rc,r (2 " +1 ' ( " +1 ''- (21) 

To prove these two identities, one may start from the following "basic" identity: 

(m + l) 2 , (22) 



y— 



which follows for instance from (11911 . 

• Assume now n m = O(m) and denote k = n/m. Then 



Thus 



(l - ^(1 - CjCk^j = cxp (-4/s(l - Cj-cjfc)) (1 + (1 - c jCfe ) 2 0(n/TO 2 )) . 
1 m c + c 

u - = s^TTp E ;: (i4)(i-c fc ) (1 ~ exp ( ~ 4k(1 _ C]Ck))) 

+ ^TTV E ^^f^fexp(-4 K (l-c jCfe ))0(n/m 2 ). 
8(m + l) 2 ^ o (1-Cj)(l-Cfc) 
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By (f2"Tjl , the absolute value of the second term above is bounded by 

i^smp ? (i-ln-«) oln/m ' ) " ° ( "" n} " 

k.j — 

Recall that Cj = cos 2m+T • Hence the first term in the expression of l rn .„ looks very much like 
a (double) Riemann sum, but one must be careful, as the integral 

r r cosx + cosy 

/ / t; tt; -(l-exp(-4K(l-coszcosy)))cfedy 

Jo Jo (1 -cos a;) (1 -cosy) 

diverges (the integrand behaves like l/x around x = 0, and like 1/y around y = 0). Let us thus 
write 

1 - exp (-4«(1 - CjCfc)) = [1 - cxp (-4k(1 - Cj))] + [1 - cxp (-4k(1 - c fe ))] 

— [1 — cxp (— 4k(1 — Cj)) — exp (— 4k(1 — Cfc)) + cxp (— 4k(1 — CjCfc))] , 

so that 

l m ,„ = 25i - S 2 + 0(1) 

with 

^ ^ 8^ £, (I-V-O) " - ° X1> '- 4K(1 - Cj))) 



and 

„ = 1 \ - + c fc 

" (m+ l )2 ^ o(1 _ c . )( l_ Cfc) 

x [1 - cxp (— 4«(1 - Cj)) - cxp (— 4k(1 — c fc )) + cxp (— 4«(1 — CjCfc))] 



The first sum reads 

1 1 ~ exp (~4k(1 ~ c j)) £z + Cfc 

1 8(m+l) 2 ^ 1-Cj ^ 1-Cfc 

v ; 3=0 J fc=0 



as 



1 1 ~ ex P (~4k(1 — Cj)) . 1 ^ 1 -exp (-4k(1 -Cj)) 

8^ 1-c, 1 jj 8(m+l)^ 1-c, 

3=0 3 v ; 3=0 3 

= (to + l) 2 and Cfc = to(to + 1) 

1 — Cfc ^— ' 1 — Cfc 

fc=o fe=o 



see (|22p). Both terms in Si are now bona fide Riemann sums. More precisely. 

4k(1 — cos a;)) 

-r = / — 

■J 

uniformly in ft £ [ki,K2], and similarly 

1 - cxp (— 4«(1 - Cj)) 



n ^ 1 - exp (-4k(1 - Cj)) , 1 + c \ = H- exp(- 

TO + 1 ^ 1 - Cj J CI 1 ~ 

3=0 



(1 + cosx)cZx + 0(l/m) 



L_ l-exp(-4K(l-c,-)) = 



TO , 

J=0 



Thus 



to f 1 — exp(— 4/t(l — cosa;)) ,. . , 

Si = — / — -(l + cosa;)efo + 0(l). 

57T ./n 1 — COS X 



Similarly, S2 is a (double) Riemann sum associated with a converging integral, and is O(l). Thus 

^, x ft f* 1 — ex p(— 4k (1 — cosx)) , . , „,„ N 

l m „ = 2Si + 0(1) = - — / ^ i ^ 1 + cos .t) dx + 0(1). 

4ttk Jr. 1 - cos X 
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The integral can be rewritten by setting t = tan(x/2), and this gives the second result of the 
proposition, with the integral expression of /(«;). The expansion of /(k) in k is then routine. B 

4. Large times: cubic and beyond 
Proposition 12. If n = n m ^s> m 3 , then 

m 2 4' 

Assume n = n rn = 0(to 3 ). That is, Kim 3 < n rn < K2m 3 for two positive constants K\ and K2- 
Then 

— ~ g(n/m ) 

where 

1 i 6 / e -- 2 (2j+i)72 ' 



Comment. When n ^> m 3 , the inversion number equals, at least at first order, its "limit" value 

T 



m(m+i) wmc j 1 j g average number of inversions in a permutation of & m +i taken uniformly at 



random. When n = 0(m 3 ), the inversion number is still quadratic in m, but with a ratio g(n), 
where k = njm . This ratio increases from to 1/4 as k goes from to oo, which makes this 
result consistent with the super-cubic regime. Note that for a related continuous time chain, 
with inversion number D^, it has been proved that when t = Km 3 , the random variable D^/m 2 
converges in probability to a function of t described in probabilistic terms [3]. 

As said above, l m n ~ m 2 /4 as soon as n 3> m 3 . Whether the next term in the expansion 
of \ m ,n is exact, that is, equal to to/4, depends on how n compares to to 3 log m, as shown by 
the following proposition. In particular, it clarifies when \ m , n = + o(m): this happens 

beyond n ~ 1/7t 2 to 3 logm. 



Proposition 13. Let c > 0, and n = n m ~ cm 3 logm. For all e > 

m(w + 1} 0(to 2 +— 2 ) = l m ,„ < m(m + 1) - 6(m 2 — 2 



4 

Thus if c < l/ir 2 , there exists 7 > such that 



m{m + 1) 



while if c> l/ir 2 , there exists 7 > such that 



- 0(m L+r ), 



For the critical value c = l/ir 2 , the following refined estimate holds: ifn = n rn ~ 1 /ir 2 m 3 log m + 
am + o{m 3 ), then 

^ = ^^-^-^(1 + 0(1)). 

4 7T 

Proof of Proposition 1121 The first result is a direct consequence of Lemma I1Q[ given that 
.too = 1-0(1/to 3 ). 

Assume now that n m = 0(to 3 ). We start from the following expression of l mj „, which corre- 
sponds to (jH): 

to(to + 1) 1 y> Cj+Cfe / 4 A" - . 

4 8(to + 1) 2 fe ^ o (1 - Cj )(i - c fc ) v "i y 
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Let M = M m be an integer sequence that tends to infinity in such a way M m = o(y / m). We 
split the sum over j and k into two parts: j < M, k < M in the first part, j > M or k > M in 
the second part. Let us prove that the second part can be neglected. We have: 



I SUM 



21 : = 



E 

j or k>M 



Cj + c fe 



(1 - Cj)(l - Cfe) 



x jk 



< 



2 1 1 

By (|22p . the sum over fc is 0(m 2 ). Moreover, the sum over j is a Riemann sum, and the function 
x n- 1/(1 — cos a;) is decreasing between and tt, so that 



m 1 ii/- 

El m + 1 / 

1—Cj~ 7T /( 



c/.c 



m + 1 



(2AJ+l)x 1 — COS X 

2TTI + 2 



7r tan 



(2A/+1)tt 
4m+4 



= o 



m 
17 



This gives 



SUM 2 = O 



m 
~M 



= o(m 4 ). 



Let us now focus on small values of j and k. The following estimates hold uniformly in j and 
k, when < j,k < M : 

128(m+l) 4 /lf2 ' 



Cfe 



(1 - Cj)(l - Cfe) 



1 (1 - CjCfe) 

m 



1 + 



\ m z 



(2j + l) 2 (2fc+l) 2 7T 4 

128m 4 

(2j + l)'(2fc+l)^ (1 + ° (1)) ' 

n^ 2 ((2j + l) 2 + (2fc + l) 2 ) 

2m(m + l) 2 
n^ 2 ((2j + l) 2 + (2fc + l) 2 ) 



(24) 



cxp 



M 4 
1 + n — 



(25) 



cxp 



2m 3 



We have used the fact that n = 0(m 3 ) and M = o( v / ro). Hence, 



SUMi 



Sir 



Cj + Cfe 



(l-c,)(l-c fe ) 

k,j=0 y ]n ' 



128m 4 



M 

E 



(2j + l) 2 



(l + o(l)) 



Finally, observe that 

M 



E 

3=0 



(2J + 1) 2 



= h 



0(e" 



= ft 



(l + o(l)) 



where 



Writing 



M 



ft(«)=E- 



-K7T 2 (2j"+l) 2 /2 



m(m +1) 1 



(2j + l) 2 

2 (SUM! + SUM 2 ) 



(m+ 1) 



then yields the result. 



16 



MIREILLE BOUSQUET-MELOU 



Proof of Proposition 1131 The first result follows from the second identity of Lemma 1 101 using 



n = cm 3 logm(l + o(l)) and log.T o = =-(1 + 0(l/m)). 

mr 

If c < l/7r 2 , we choose e such that 7 := 1 — cir 2 — e > 0, and the second result follows. If c > 1/ir 2 , 
we choose e such that 7 := C7r 2 — 1 — e > 0, and the third result follows. 

Assume now n = l/7r 2 m 3 logm + am 3 + o(m 3 ). We will show that the sum over j and k 
occurring in (|23p is dominated by the contribution obtained for j = k = 0. This contribution 
reads 

2c n „ 128m 3 



(1 - co) 



r^oo 



-e— (l + o(l)). 



The contribution obtained for j = k = in is 

2c 



■xoo" = 0(l/m). 



(1 + co) 2 

Finally, for all other values of j and k, there holds < Xjk < xqi- Hence 



E 



Cj + Cfc 



0<fe,j<Tn 
(fe,3)^(0,0),(m,m) 



(1- Cj )(l-C fc ) 



Xjk 



< 2x 01 r 



y L 

a-* 1 



0<fcj<m 



(1- Cj )(l-C fe ) 



= 2(m+l) 4 a:oi" = 0(l/m). 



We have first used the identity (|2^|) . then n = 1/7T 2 ™ 3 logm + am 3 + o(m 3 ), and logxoi 
— 57r 2 /m 3 (l + o(l)). The estimate of \ m ,n follows. 



5. The intermediate regime 
Proposition 14. Assume m -C n m <C m 3 . Then, denoting n = n m , 

1 /¥ 

■ m.n / ^ 



Comment. By Propositions [Til and IT2l one has 

I m. . n 



1 run 



while 



It can be proved that 



y/~K,f{n) if n. 



if n m ~ rem 3 . 



lim \JHf{K,) = \ — = lim ^7=^, 

which makes all three regimes consistent. 

Note that for a related continuous time chain, with inversion number D l m , it has been proved 
that when m <C t <C m 3 , the random variable D\ n j\fmt converges in probability to \J2pn 0j. 

Proof. The proof mixes arguments that we have already used for small and large times. As in 
the small time case, we start from (f2"U|) . As in the large time case, we split the sum over j and 
k into two parts: j < M, k < M in the first part, j > M or k > M in the second part. Here, 
M = M m is an integer sequence that satisfies 

/m 3 fm\ V4 

< M < mm , m — 

n \ n \ n 

Such a sequence exists under the assumptions we have made on n. 
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We begin by bounding the second part. As in the large time case, one easily obtains 



|SUM 2 



j or k>M 



(1- Cj )(l-C k ) 



< 



y — 

^ „ l-Cw 1 



j or k>M 



(l- Cj .)(l-c fe ) 



m 
17 



(26) 



We now focus on the case j, k < M. The estimates ((24|) and ([25]) still hold uniformly in j and 
fc. This gives 



k,j=0 



128m 4 ^ ex P 



n7r 2 ((2j + l) 2 + (2fc+l) 2 ) 
2m3 



Using 



j,fc=0 



1 - e a+b = (1 - e a ) + (1 - e b ) - (1 - e a )(l - e") 



(2j + l) 2 (2fc + l) 2 



(l + o(l)) 



this may be rewritten as 



SUM! = ^ (2S M f: S M A (1 + o(l)) (27) 



with 



M 1 qrnf n7r 2 (2j+l) 2 \ 



.7=0 



(2J + 1) 2 



The sum Sm is close to a Riemann sum. More precisely 
Sm 



as 



-dx 



1 - e" 



-di (l + o(l)) = 0r + o(l). 



In particular, 5m = o(l). Returning to (j2"7) , let us finally note that 



M 1 7T 2 

^(2TTTF = T (1 + 0(1)) - 

fc=o v ; 



This gives 



Now recall that 



16 



SUMi = -= mf^mn(\ + o(l)) . 

V 27T 



I, 



■(SUMi+SUM 2 ). 



8(m+ l) 2 

Combining the above estimate of SUMi and the bound (f2"fj|) on SUM2 gives the announced 
estimate of \ m „. ■ 
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6. Perspectives 

Many interesting Markov chains on groups have been studied, and it is natural to ask to which 
similar problems the approach used in this note could be adapted. To make this question more 
precise, let us underline that such problems may involve changing the dynamics of the chain, 
changing the statistics under consideration, or changing the underlying group. 

Changing the chain. Already in the symmetric group & m +±, a number of different dy- 
namics have been considered. One can for instance multiply at each step by any transposi- 
tion [31 1151 128| , or consider cyclic permutations by allowing multiplications by the transposi- 
tions (0, 1), (1, 2), . . . , (m — 1, m) and (m, 0), or only allow transpositions with the largest ele- 
ment |221 114) . or perform block transpositions [6], or shuffle the elements in various ways [2], etc. 
See for instance |12) . 

Changing the statistics. We have focused in this paper on the inversion number. It is an 
interesting parameter from a probabilistic point of view, as it gives an indication of whether the 
chain may be mixed at time n or not. However, in certain biological contexts, it may be more 
sensible to estimate instead the natural "distance" between and n^ n ' , defined as the minimum 
number of chain steps that lead from one to the other [3J H]. For the chain studied here, this 
coincides with the inversion number, but for other dynamics it will be a different parameter. For 
instance, if we multiply by any transposition, one step suffices to go from 012 to 210, whereas 
the inversion number of 210 is 3. Clearly, the approach we have used here relies heavily on 
the possibility of describing in simple terms the evolution of the parameter under consideration 
(as we did by the combination of ^ and Lemma |3J . Note that an explicit expression for the 
expected distance after n (non-necessarily adjacent) transpositions has been given in |19j . 

Changing the group. Although many different chains in many different groups have been 
considered, we are primarily thinking of classical families of (finite or affine) Coxeter groups, 
because the inversion number admits in these groups a natural generalization (the length) which 
has usually a simple description [21 Chap. 8]. According to [18], an explicit formula, due to 
Troili, is already known for the average length of the product of n generators in the affine group 
A m . Another solved case is the hypercube |13) . 

Acknowledgements. The author thanks an anonymous referee, Niklas Eriksen and David 
Wilson for their suggestions, comments, and help with the bibliography. 
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